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ABSTRACT 

The definitions of and literature on teat speededness 
are reviewed, and factors and areas in which the tests reveal their 
strengths and limitations are discussed. The focus of this article is 
on speeded reading con^irehension tests used for entry or re-entry 
placement in postsecondary institutions. Speeded tests were prompted 
by the accelerating popularity of timed tests initially used for mass 
assessment of World War I recruits and conscripts. Early research 
reviews dealt almost entirely with studies based on comprehensive 
student t^puJations. Later investigations l»gan to direct attention 
toward underp-^epared populations. Almost inevitably, students who 
score low on the reading test portion of an entry assessment are 
candidates for intervention. Issues covered include speed set and 
anxiety, test wiseness and accuracy, and backwash. Speed set is a 
rate-of-worJt mind set purposefully executed to complete a task during 
the allot^ed tiwe instead of the time the task demands. Backwash 
refers to the effect that a test of particular skill has on the 
acquisition of that skill. The preponderance of research indicates 
that the tests in question are inappropriate for comprehensive 
populations as well as developmental students. There are a number of 
acceptable alternatives to speeded comprehension tests. A 52-item 
list of references is included. (TJH) 
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A SLOW LOOK AT SPEEDED READING COMPREHENSION TESTS 

By Gene Kerstiens 

That placement testing is an experience facing nearly 
every entering freshman and many re-entry students is a veil 
established postsecondar> reality. The test or test battery 
typically includes a speeded reading comprehension component 
employing challenging reading passages that require exacting 
reading and thinking skills in order to answer an abundance of 
accompanying test Items. So numerous are the passages and items 
and so limited is the testing time that many students do not 
complete the test unless they resort to testvise, score-inflating 
skimming or random responding strategies. 

Almost inevitably, students who score low on the 
reading test portion of an assessment are candidates for 
intervention. Identified as developmental, remedial, 

underprepared, or otherwise, these students are counseled and/or 
required to participate in courses, programs or other treatments 
designed to remedy whatever reading deficiency or disorder the 
test can be interpreted to reveal. Having participated in a 
^ prescribed intervention and before gaining access to other 

programs or courses in the curriculum, these students once aqain 
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may be administered the same or another form of a speeded reading 
test. Unanimous research and expert opinion to the contrary, the 
sufficiently appreciated scores on this retest often are 
construed to predict competence in college-level reading skills. 
When we consider that at least half of the students evaluated in 
a given entering or re-entering college population can be 
identified as under prepared , the significance (or enormity) of 
the testing as well as the placement process takes on formidable 
proport ions . 

Addressing these matters, this investigation will 
explicate the definitions ot test speededness, review the 
professional literature on the topic, and discuss identifiable 
factors and areas in which the tests reveal their strengths and 
limitat ions . 



SPEEDED COMPREHENSION TESTS 

Recognized in the literature is the distinction between 
power and speed tests. Generally, reading comprehenion tests 
having no time limits are characterized as power tests. When 
time is imposed so that all participants do not finish the test, 
it is considered a speeded instrument. Therefore, the degree of 
speededness assigned to a given test is relative to the 
completion rate or near completion rate of a norming population. 

The Educational Testing Service "considers a test to be 
speeded if fewer than 100 percent of candidates reach 75 percent 
of the items and fewer than 80 percent of the candidates finish 
the test." (Davis, Kaiser, Boone 1987, p. 9). Discussing 



reasonable accommodation tor developmental testing c.andi«3dtes , 
Morante (1989) aiiserts that 90 percent of students j.hould 
complete all Items; Nunnally (1978), identifying a "comf ort.jbl e 
time limit" (p. 632), agrees. Studying developmental students' 
observed and stated behaviors as well as compared scores on nine 
standardized reading comprehension measures, Kerstiens (1986a and 
1986b) used the term time-cr itical to identify any timed reading 
test scored without correction for error whose time constraints 
are such that it establishes or requires test-taking speed set by 
(1) encouraging the employment of score-inflating testwise 
strategies, {?.) rewarding rapid responding at the expense of 
accuracy, or (3) eliciting a negative or interfering level of 
anxiety. 

Among these and other numerous operational definitions that 
have evolved, there is little close concurrence (Davis, Kaiser, 
Boone 1987). Nonetheless, that the majority of reading 
comprehension tests presently implemented in assessment programs 
would qualify as highly speeded even under a conservative 
definitional umbrella seems reasonable. 

REVIEW OF REVIEWS OF SEVENTY YEARS OF RESEARCH ON 
SPEEDED READING COMPREHENSION TESTS 

Early-on and prompted by the accelerating popularity of 
timed tests initially used for mass assessment of World War I 
recruits and conscripts. Gates (1921) conducted an extensive 
review of re:>earch of reading tests. He learned that on timed 



redding testa higher scorers were less arcurato, passages 
represented a too limited number of academic content areas, and 
that chance-success responses inflated scores. Corroborating 
Gates' findings, Flanagan's (1939) bibliographical search as veil 
as his own study also revealer' that higher scoring students were 
less accurate than their lower scoring peers, that indeed 
students' scores were inversely related to their accuracy of 
response. Likewise, Carlson's (1949) review concluded that timed 
reading comprehension testing's psychometric shortcomings 
included the condition that they penalized slower but more 
accurate readers. In 1958, I-etson reviewed nineteen studies 
stating that "speed of working" unfairly escalated scores, 
particularly on instruments presenting more difficult test items. 
Rankin's (1962) investigation cited numerous critics who found 
design flaws in time-critical tests of reading comprehension, and 
Davis (1962) discovered that the same tests, scored without 
correction for chance success, were invalid. Both of the writers 
agreed that post-test gains on timed reading tests were enjoyed 
chiefly by sacrificing accuracy of comprehension. Taking 
previous studies and reviews into account together with the 
findings of his own study, Tillman (1977), in a seemingly fin de 
giecle pronouncement, declared that "Ironically, the increasing 
popularity of certain tests seems to be inversely related to the 
negative comments of critics." (p. 253) 

While earlier research reviews dealt almost entirely with 
studies based upon comprehensive student populations, later 
investigations began to direct their attention to underprepar ed 
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populations. This area of i nvect igat i on apparently had gone 
unrecognized, having provoked insuf liclfnt dcadeinic curiosity to 
Inspire publishable research. Focusing on the experience of less 
academically adept students and citing ten relatively recent 
studies centered on the use of the most frequently employed test, 
Perkins (1984) strongly questioned its validity. She recommended 
"a re-examination of the use of the NDRT as an instrument which 
determines whether or not college freshmen possess adequate 
reading skills." (p. 67) Rounds' (1984) review echoed this 
dissatisfaction with the NDRT and other popular reading 
comprehension tests, finding them to bp poor assessment 
instruments as well as unreliable predictors of California 
community college course success. Kerstiens' (1986ab) studies 
and research reviews concluded that developmental students' 
scores on time-critical reading test measures were attributable 
largely to student rate of response, accuracy of response 
remaining at about thirty-five percent irrespective of score 
level. Fi.ially, a carefully focused investigation involving an 
underprepared Tennessee population together with an extensive 
research review (Davis, Kaiser, Boone 1987) recommended that time 
limits for the test in question be extended by thirty percent to 
assure fairer measurement. 

Truly, among the seven decades of studies and statements on 
the topic of comprehension test speededness, there are a handful 
of publications affirming that, given the properties of certain 
ipotruments and the level of the testing population, the negative 
effects of speededness are minimal, inconsequential or 
inoffensive. None of these champions the condition of timing. 



er|c 7 



vhni>e app.-jrent virtues are cidroin istrat ivr conven^enc^^ arKi Ui»* 
Mortiny of a talented reading elite. (Carver, 1984; Cumminr 1981; 
Raygor, 1978.) As shall be demonstrated, the psychometric, 
behavioral, affective, and curricular side effects of reading 
comprehension test speededness are substantial. 

SPEED SET AND ANXIETY 

Speed set is a rate-of-work mind set purposefully executed 
to complete a task during the allotted time instead of the time 
the task demands. Although virtually all testing consists of 
less than life-like samplings, time-critical tests effect a 
sprinted setting that is considered demonstrably artificial. 
(Bartin 1975; Rubenstein, Kender, Mace 1988) Nor do they address 
themselves precisely to assessing the basic abilities vhich 
facilitate a student's functioning while learning from 
challenging textual material. (McDonald 1966; Tuinman 1970) 
Especially for students given to deliberative reading, the 
urgency of the testing situation is n -t consistent with work 
requiring a study-reading approach rather than adroitly executed 
skim-and-respond strategies. (Rodriguez 1965; Townsend 1965) 
Dealing with difficult material, even the best readers retard 
rates in order to re- invest igate or reflect. (Carver 1985) But 
the typical underprepared student often thinks through silent 
speech (Pauk 1968; Feldman 1976) and routinely exercises re- 
reading behaviors while striving to comprehend. (Van Blerkom and 
Van Blerkom 1981) Also, the test items themselves often require 
more analysis, manipulation and interpretation than the reading 
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pactiager. they attend (Raygor 1978; Rtiyyor and Flippo 1971) •■ all 
of these tasks requiring careful £»v..iaiii9 .^fid soroe reflection if 
the intention is to comprehend accurately. 

Closely related to thr generic speed set issue is the 
phenomenon of anxiety. Virtually all writers agree that the 
imposition of time limits heightens anxiety in subjects and that 
the higher the speededness factor, the greater the anxiety. Wark 
and Bennett (1981) and Wark, Testenrud and Nelms (1972) studied 
the psychological as veil as physiological aspects of anxiousness 
under speeded reading test conditions to establish that higher- 
anxious students' scores are negatively affected. Cummins (1981) 
and Davis, Kaiser and Boone (1987) indicate that such over- 
anxious reactions inhibit concentration. In their Mianl-Dade 
study ( 1988), Schinoff and Steed surveyed and oh::erved 
developmental students to learn that they thought the speededness 
factor was undesirable. As early as 1951, Preston and Hotel 
declared that the speed set/anxiety factor rendered timed reading 
tests "invalid and untenable." (p. 71) 

TESTWISENESS AND ACCURACY 
"Testwiseness is the ability to correctly answer test 
questions o;. some basis other than knowledge that the questions 
were designed to measure." (Ferrell 1977, p. 138) Since the 
majority of timed reading comprehension tests employed in 
postsecondary assessment are scored without correction for error 
(Davis 1962), they are particularly vulnerable to score-inflating 
test-taking strategies. (Gates 1921; Kerstiens 1986ab; Morante 
1989; Townsend 1965) The more challenging the test's reading 



lUiV.Dnqvh and tcr.t items, the niurf conr.t ra i ned t}ie timtf limit., ar»d 
the tilower the rate ot respcnjse of the student, the more likely 
the testwise candidate will resort to cultivated guessing and 
random responding strategies. To the extent that studenttj h.jve 
knowledge of these strategies and are willing to exercise it, 
their test scores can be significantly enhanced, these practices 
diminishing the integrity of individual sctres and the validity 
of the instrument. Of course, non~testwlse students or those 
whose ethical predisposition prevents then, from exerting their 
testwiseness are penalized. 

That test speededness affects a student's rate and accuracy 
of response is well supported in the literature. (Carlson 1949; 
Cronan 3987; Gabriel an<= Richards 1988; Jolly 1985; Sadler and 
Whimbey 1985; Stetson 1981; Townsend 1965) And that students can 
learn or be taught guessing strategies to increase their scores 
is commonly acknowledged. But that students actually improve 
their reading comprehension having applied thecse strategies is 
questionable and undocumented. 

Therefore, writers express concern that encouraging test- 
taking guessing habits fosters certain inappropriate learning 
attitudes and behaviors: wild guessing (Gates 1921); answer 
rrabbing (Sadler and Whitnbey 1985); one-shot thinking (Gibson 
1986); chance success attitude (Davis 1962); and skim and guess 
syndrome. (Townsend 1965). Accordingly, strongly suggested in 
the literature is the notion that utilizing instruments that 
encourage these behaviors, and, further, drive the direction of 
the reading curriculum designed to inculcate effective reading 
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skills (1) encourage inappropriate habits and attitude?:, toward 
learning, (2) dli^courage the sharpening of critical reading -And 
thinking skills, and (3) essentially deter students; from the 
realities of effectively learning from their textbookc. 

BACKWASH 

"Backwash refers to the effect a test of particular skill 
has on the acquisition of that skill." (Backwash 1983, p. 1.) 
The backwash result can produce desirable or undesirable effects; 
undesirable conditions occur when there is a mismatch between 
measurement and instruction. (Popham 1978) Researchers concur 
that the fall-out of tiroe-crit ical reading comprehension testing 
is short of salutary. 

Besides having some unrewarding immediate influences on 

students' reading behaviors, our typical reading assessment 

practices may also provide students with unrealistic expectations 

and may influence practitioners to misdirect instruction. Since 

these tests do not present a fair sampling of a broad range of 

subject matter that represents a variety of acadv^mic disciplines 

{Morante 1989; Wood 1989), they cannot guage the scope and scale 

of stjdents' schena or prior knowledge that affect comprehension. 

(Durkin 1981; Hirsch 1987) Nor are the tests able to measure 

skills needed to cope with the vide range of difficulty 

encountered in assigned textbooks. (Bartin; Kurak 1967) Because 

A 

many of these tests are freighted with "bookish" (Raygor 1978, P. 
1210) passages and test items that appear to be calculatedly 
intimidating, they fail to "discriminate best among students of 
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low proficiency" (Morante 1989, P. 2) whose tjeed Ivr accutdte 
assessment is grciiteLt. Thus Gtudents as wt 11 up pr act 11 i onerj. 
ate presented with tests suggesting directions for skill 
attainment that are unrealistic and ultimately f r uct rat incj . 

But of particular concern of late Is this ter^t i ng ' backvach 
effect on the reading curriculum. Apprehension emerges from the 
notion that if testing is allowed to drive the reading curriculum 
"students will not learn the skills and abilities that they need 
to perform college reading tasks effectively." (Wood 1988, p. 
229) Wood (1989) is further disquieted by the expectation that 
if tests do not authentically reflect the demands of college 
reading assignments, they tend to impose "artificial limits" on 
the reading curriculum. 

The concern is hardly trivial. For in her celebrated and 
repeatedly cited research, Fairbanks (1972) (also see Farrell 
1975) specifies the reading skills whose mastery predicts success 
in college courses. We know that the academic achievement of 
college students improves when they learn to identify main ideas, 
distinguish between fact and opinion, and recognize and interpret 
inferences while managing a range of metacogni tive sub-skills 
enumerated by Mickler (1989), Any notion that the tests in 
question begin to evaluate these skills fairly is unreasonable. 
That college reading programs can prepare their under prepared 
students while focusing on the skills measured by the tests is 
unjust i fiable . 

DISCUSSION 

Of course, there are assessment alternatives to speeded 
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reading comp'-ehens i on tests. {All:n.3 1987- Schinoff ap<3 Steed 
1988) And given the preponderance of research indicating that 
the tests in question are inappropriate for comprehensive 
populations as well as developmental students, one wonders how 
the academic community can continue to ignore the findings with 
seemingly serene indifference. Perhaps the poignant question is 
How can those responsible for postsecondary assessment continue 
to sacrifice accuracy of measurement for administrative 
expedience? 

But that is the topic of another publication. 
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